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SEQUENCE LISTING 

<110> DSM IP Assets B.V. 

<120> Feedback- resistant mevalonate kinases 

<130> Case 21788 

<150> EP 03012294.9 
<151> 2003-06-12 

<160> 33 

<170> Patentin version 3.2 

<210> 1 
<211> 378 
<212> PRT 

<213> Paracoccus zeaxanthinifaciens 
<400> 1 

Met Ser Thr Gly Arg Pro Glu Ala Gly Ala His Ala Pro Gly Lys Leu 
15 10 15 

lie Leu Ser Gly Glu His Ser Val Leu Tyr Gly Ala Pro Ala Leu Ala 

20 25 30 

Met Ala lie Ala Arg Tyr Thr Glu val Trp Phe Thr Pro Leu Gly lie 
35 40 45 

Gly Glu Gly lie Arg Thr Thr Phe Ala Asn Leu Ser Gly Gly Ala Thr 
50 55 60 

Tyr Ser Leu Lys Leu Leu Ser Gly Phe Lys Ser Arg Leu Asp Arg Arg 
65 70 75 80 

Phe Glu Gin Phe Leu Asn Gly Asp Leu Lys val His Lys Val Leu Thr 

85 90 95 

His Pro Asp Asp Leu Ala Val Tyr Ala Leu Ala Ser Leu Leu His Asp 

100 105 110 

Lys Pro Pro Gly Thr Ala Ala Met Pro Gly lie Gly Ala Met His His 
115 120 125 

Leu Pro Arg Pro Gly Glu Leu Gly Ser Arg Thr Glu Leu Pro lie Gly 
130 135 140 

Ala Gly Met Gly Ser Ser Ala Ala lie Val Ala Ala Thr Thr Val Leu 
145 150 155 160 



Phe Glu Thr Leu Leu Asp Arg Pro Lys Thr Pro Glu Gin Arg Phe Asp 
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165 170 175 

Arg val Arg Phe Cys Glu Arg Leu Lys His Gly Lys Ala Gly Pro lie 

180 185 190 

Asp Ala Ala Ser val Val Arg Gly Gly Leu Val Arg val Gly Gly Asn 
195 200 205 



Gly Pro Gly Ser lie Ser Ser Phe Asp Leu Pro Glu Asp His Asp Leu 
210 215 220 

val Ala Gly Arg Gly Trp Tyr Trp val Leu His Gly Arg Pro Val Ser 
225 230 235 240 

Gly Thr Gly Glu Cys val Ser Ala val Ala Ala Ala His Gly Arg Asp 

245 250 255 

Ala Ala Leu Trp Asp Ala Phe Ala val cys Thr Arg Ala Leu Glu Ala 

260 265 270 

Ala Leu Leu Ser Gly Gly Ser Pro Asp Ala Ala lie Thr Glu Asn Gin 
275 280 285 

Arg Leu Leu Glu Arg lie Gly val val Pro Ala Ala Thr Gin Ala Leu 
290 295 300 

val Ala Gin lie Glu Glu Ala Gly Gly Ala Ala Lys lie Cys Gly Ala 
305 310 315 320 

Gly Ser val Arg Gly Asp His Gly Gly Ala val Leu val Arg lie Asp 

325 330 335 

Asp Ala Gin Ala Met Ala Ser val Met Ala Arg His Pro Asp Leu Asp 

340 345 350 

Trp Ala Pro Leu Arg Met Ser Arg Thr Gly Ala Ala Pro Gly Pro Ala 
355 360 365 

Pro Arg Ala Gin Pro Leu Pro Gly Gin Gly 
370 375 



<210> 2 

<211> 396 

<212> PRT 

<213> homo sapiens 

<400> 2 

Met Leu Ser Glu Val Leu Leu val Ser Ala Pro Gly Lys val lie Leu 
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10 



15 



His Gly Glu His Ala val val His Gly Lys val Ala Leu Ala val Ser 

20 25 30 

Leu Asn Leu Arg Thr Phe Leu Arg Leu Gin Pro His Ser Asn Gly Lys 
35 40 45 

val Asp Leu Ser Leu Pro Asn lie Gly lie Lys Arg Ala Trp Asp Val 
50 55 60 

Ala Arg Leu Gin ser Leu Asp Thr Ser Phe Leu Glu Gin Gly Asp Val 
65 70 75 80 



Thr Thr Pro Thr Ser Glu Gin val Glu Lys Leu Lys Glu val Ala Gly 

85 90 95 

Leu Pro Asp Asp Cys Ala val Thr Glu Arg Leu Ala val Leu Ala Phe 

100 105 110 

Leu Tyr Leu Tyr Leu ser lie Cys Arg Lys Gin Arg Ala Leu Pro Ser 
115 120 125 

Leu Asp He Val val Trp Ser Glu Leu Pro Pro Gly Ala Gly Leu Gly 
130 135 140 

Ser Ser Ala Ala Tyr Ser val Cys Leu Ala Ala Ala Leu Leu Thr val 
145 150 155 160 

Cys Glu Glu lie Pro Asn Pro Leu Lys Asp Gly Asp Cys Val Asn Arg 

165 170 175 

Trp Thr Lys Glu Asp Leu Glu Leu lie Asn Lys Trp Ala Phe Gin Gly 

180 185 190 

Glu Arg Met lie His Gly Asn Pro Ser Gly val Asp Asn Ala val Ser 
195 200 205 

Thr Trp Gly Gly Ala Leu Arg Tyr His Gin Gly Lys lie Ser Ser Leu 
210 215 220 

Lys Arg ser Pro Ala Leu Gin lie Leu Leu Thr Asn Thr Lys Val Pro 
225 230 235 240 

Arg Asn Thr Arg Ala Leu Val Ala Gly val Arg Asn Arg Leu Leu Lys 

245 250 ~ 255 
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Phe Pro Glu lie val Ala Pro Leu Leu Thr Ser lie Asp Ala lie Ser 

260 265 270 

Leu Glu Cys Glu Arg val Leu Gly Glu Met Gly Glu Ala Pro Ala Pro 
275 280 285 

Glu Gin Tyr Leu val Leu Glu Glu Leu lie Asp Met Asn Gin His His 
290 295 300 

Leu Asn Ala Leu Gly Val Gly His Ala Ser Leu Asp Gin Leu Cys Gin 
305 310 315 320 

val Thr Arg Ala Arg Gly Leu His Ser Lys Leu Thr Gly Ala Gly Gly 

325 330 335 

Gly Gly Cys Gly lie Thr Leu Leu Lys Pro Gly Leu Glu Gin Pro Glu 

340 345 350 

val Glu Ala Thr Lys Gin Ala Leu Thr Ser cys Gly Phe Asp Cys Leu 
355 360 365 

Glu Thr Ser lie Gly Ala Pro Gly val Ser lie His Ser Ala Thr Ser 
370 375 380 

Leu Asp Ser Arg val Gin Gin Ala Leu Asp Gly Leu 
385 ~ 390 395 



<210> 3 

<211> 395 

<212> PRT 

<213> mouse 

<400> 3 

Met Leu Ser Glu Ala Leu Leu val Ser Ala Pro Gly Lys val lie Leu 
15 10 15 

» 

His Gly Glu His Ala val val His Gly Lys val Ala Leu Ala Ala Ala 

20 25 30 

Leu Asn Leu Arg Thr Phe Leu Leu Leu Arg Pro Gin Ser Asn Gly Lys 
35 40 45 

val Ser val Asn Leu Pro Asn lie Gly lie Lys Gin val Trp Asp val 
50 55 60 

* 

Gly Met Leu Gin Arg Leu Asp Thr Ser Phe Leu Glu Gin Gly Asp Val 
65 70 75 80 
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Ser Val Pro Thr Leu Glu Gin Leu Glu Lys Leu Lys Lys Met Gly Asp 

85 90 95 

Leu Pro Arg Asp Arg Ala Gly Asn Glu Gly Met Ala Leu Leu Ala Phe 

100 105 110 

Leu Tyr Leu Tyr Leu Ala lie Cys Arg Lys Gin Arg Thr Leu Pro ser 

115 120 125 



Leu Asp Met val val Trp Ser Glu Leu Pro Pro Gly Ala Gly Leu Gly 
130 135 140 

Ser Ser Ala Ala Tyr Ser val Cys Leu Ala Ala Ala Leu Leu Thr Ala 
145 150 155 160 

Cys Glu Glu val Ser Asn Pro Leu Lys Asp Gly val Ser val Ser Arg 

165 170 175 

Trp Pro Glu Glu Asp Leu Lys Ser lie Asn Lys Trp Ala Phe Glu Gly 

180 185 190 

Glu Arg val lie His Gly Asn Pro Ser Gly val Asp Asn Ala val Ser 
195 200 205 

Thr Trp Gly Gly Ala Leu Arg Phe Gin Gin Gly Thr Met Ser Ser Leu 
210 215 220 

Lys Ser Leu Pro Ser Leu Gin lie Leu Leu Thr Asn Thr Lys val Pro 
225 230 235 240 

Arg ser Thr Lys Ala Leu Val Ala Ala Val Arg Ser Arg Leu Thr Lys 

245 250 255 

Phe Pro Glu lie val Ala Pro Leu Leu Thr Ser lie Asp Ala lie Ser 

260 265 270 

Leu Glu Cys Glu Arg val Leu Gly Glu Met val Ala Ala Pro val Pro 
275 280 285 

Glu Gin Tyr Leu val Leu Glu Glu Leu lie Asp Met Asn Gin His His 
290 295 300 

Leu Asn Ala Leu Gly val Gly His Asn ser Leu Asp Gin Leu Cys Gin 
305 310 315 320 

val Thr Ala Ala His Gly Leu His Ser Lys Leu Thr Gly Ala Gly Gly 

325 330 335 
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Gly Gly Cys Gly lie Thr Leu Leu Lys Pro Gly Leu Glu Gin Ala Thr 

340 345 350 

Val Glu Ala Ala Lys Gin Ala Leu Thr Ser Cys Gly Phe Asp Cys Trp 
355 360 365 



Glu Thr Ser lie Gly Ala Pro Gly Val Ser Thr His Ser Ala Ala Ala 
370 375 380 

val Gly Asp Pro Val Arg Gin Ala Leu Gly Leu 
385 390 395 



<210> 4 

<211> 395 

<212> PRT 

<213> rat 

<400> 4 

Met Leu Ser Glu val Leu Leu Val Ser Ala Pro Gly Lys val lie Leu 
15 10 15 

His Gly Glu His Ala val val His Gly Lys val Ala Leu Ala Val Ala 

20 25 30 

Leu Asn Leu Arg Thr Phe Leu val Leu Arg Pro Gin Ser Asn Gly Lys 
35 40 45 

val Ser Leu Asn Leu Pro Asn val Gly lie Lys Gin val Trp Asp val 
50 55 60 

Ala Thr Leu Gin Leu Leu Asp Thr Gly Phe Leu Glu Gin Gly Asp val 
65 70 75 80 

Pro Ala Pro Thr Leu Glu Gin Leu Glu Lys Leu Lys Lys val Ala Gly 

85 90 95 

Leu Pro Arg Asp Cys val Gly Asn Glu Gly Leu Ser Leu Leu Ala Phe 

100 105 110 

Leu Tyr Leu Tyr Leu Ala lie Cys Arg Lys Gin Arg Thr Leu Pro Ser 
115 120 125 

Leu Asp lie Met Val Trp ser Glu Leu Pro Pro Gly Ala Gly Leu Gly 
130 135 140 

Ser ser Ala Ala Tyr ser val Cys Val Ala Ala Ala Leu Leu Thr Ala 
145 150 155 160 
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Cys Glu Glu Val Thr Asn Pro Leu Lys Asp Arg Gly Ser lie Gly Ser 

165 170 175 

Trp Pro Glu Glu Asp Leu Lys Ser lie Asn Lys Trp Ala Tyr Glu Gly 

180 185 190 



Glu Arg val lie His Gly Asn Pro Ser Gly val Asp Asn Ser Val Ser 
195 200 205 



Thr Trp Gly Gly Ala Leu Arg Tyr Gin Gin Gly Lys Met Ser Ser Leu 
210 215 220 



Lys Arg Leu Pro Ala Leu Gin lie Leu Leu Thr Asn Thr Lys val Pro 
225 230 235 240 

Arg ser Thr Lys Ala Leu val Ala Gly Val Arg Ser Arg Leu lie Lys 

245 250 255 

Phe Pro Glu lie Met Ala Pro Leu Leu Thr Ser lie Asp Ala lie ser 

260 265 270 

Leu Glu Cys Glu Arg val Leu Gly Glu Met Ala Ala Ala Pro val Pro 
275 280 285 

Glu Gin Tyr Leu val Leu Glu Glu Leu Met Asp Met Asn Gin His His 
290 295 300 

Leu Asn Ala Leu Gly val Gly His Ala ser Leu Asp Gin Leu Cys Gin 
305 310 315 320 

val Thr Ala Ala His Gly Leu His Ser Lys Leu Thr Gly Ala Gly Gly 

325 330 335 

Gly Gly Cys Gly lie Thr Leu Leu Lys Pro Gly Leu Glu Arg Ala Lys 

340 345 350 

val Glu Ala Ala Lys Gin Ala Leu Thr Gly Cys Gly Phe Asp Cys Trp 
355 360 365 

Glu Thr Ser lie Gly Ala Pro Gly val ser Met His ser Ala Thr ser 
370 375 380 

lie Glu Asp Pro val Arg Gin Ala Leu Gly Leu 
385 390 395 

<210> 5 
<211> 378 
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<212> PRT 

<213> Arabidopsis thai i ana 
<400> 5 

Met Glu val Lys Ala Arg Ala Pro Gly Lys lie lie Leu Ala Gly Glu 
15 10 15 

His Ala val val His Gly Ser Thr Ala val Ala Ala Ala lie Asp Leu 

20 25 30 

Tyr Thr Tyr val Thr Leu Arg Phe Pro Leu Pro Ser Ala Glu Asn Asn 
35 40 45 

Asp Arg Leu Thr Leu Gin Leu Lys Asp lie Ser Leu Glu Phe Ser Trp 
50 55 60 

Ser Leu Ala Arg lie Lys Glu Ala lie Pro Tyr Asp Ser Ser Thr Leu 
65 70 75 80 

Cys Arg Ser Thr Pro Ala Ser Cys Ser Glu Glu Thr Leu Lys Ser He 

85 90 95 

Ala Val Leu val Glu Glu Gin Asn Leu Pro Lys Glu Lys Met Trp Leu 

100 105 110 

Ser Ser Gly lie Ser Thr Phe Leu Trp Leu Tyr Thr Arg lie lie Gly 
115 120 125 

Phe Asn Pro Ala Thr val Val lie Asn Ser Glu Leu Pro Tyr Gly Ser 
130 135 140 

Gly Leu Gly Ser Ser Ala Ala Leu Cys Val Ala Leu Thr Ala Ala Leu 
145 150 155 160 

Leu Ala Ser ser lie Ser Glu Lys Thr Arg Gly Asn Gly Trp Ser ser 

165 170 175 

Leu Asp Glu Thr Asn Leu Glu Leu Leu Asn Lys Trp Ala Phe Glu Gly 

180 185 190 

Glu Lys lie lie His Gly Lys Pro Ser Gly lie Asp Asn Thr val Ser 
195 200 205 

Ala Tyr Gly Asn Met lie Lys Phe Cys Ser Gly Glu lie Thr Arg Leu 
210 215 220 

Gin Ser Asn Met Pro Leu Arg Met Leu lie Thr Asn Thr Arg val Gly 
225 230 235 240 
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Arg Asn Thr Lys Ala Leu val ser 



Gly 



Val 
250 



Ser 



Gin 



Arg 



Ala 



val Arg 
255 



245 



His Pro Asp Ala Met Lys Ser Val Phe Asn Ala Val Asp Ser lie Ser 

260 265 270 



Lys Glu Leu Ala Ala lie lie Gin Ser Lys Asp Glu Thr Ser val Thr 
275 280 285 



Glu Lys Glu Glu Arg lie Lys Glu Leu Met Glu Met Asn Gin Gly Leu 
290 295 300 



Leu Leu Ser Met Gly val Ser His Ser Ser lie Glu Ala val lie Leu 
305 310 315 320 



Thr Thr val Lys His Lys Leu val Ser Lys Leu Thr Gly Ala Gly Gly 

325 330 335 



Gly Gly Cys Val Leu Thr Leu Leu Pro Thr Gly Thr val val Asp Lys 

340 345 350 



Val val Glu Glu Leu Glu Ser Ser Gly Phe Gin Cys Phe Thr Ala Leu 
355 360 365 



lie Gly Gly Asn Gly Ala Gin lie Cys Tyr 
370 375 



<210> 6 
<211> 443 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 6 

Met Ser Leu Pro Phe Leu Thr Ser Ala Pro Gly Lys Val lie He Phe 
15 10 15 



Gly Glu His Ser Ala val Tyr Asn Lys Pro Ala val Ala Ala Ser val 

20 25 30 



Ser Ala Leu Arg Thr Tyr Leu Leu lie Ser Glu ser ser Ala Pro Asp 
35 40 45 



Thr lie Glu Leu Asp Phe Pro Asp lie Ser Phe Asn His Lys Trp ser 
50 55 60 



lie Asn Asp Phe Asn Ala lie Thr Glu Asp Gin val Asn Ser Gin Lys 
65 70 75 80 
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Leu Ala Lys Ala Gin Gin Ala Thr Asp Gly Leu Ser Gin Glu Leu val 

85 90 95 

Ser Leu Leu Asp Pro Leu Leu Ala Gin Leu Ser Glu Ser Phe His Tyr 

100 105 110 

His Ala Ala Phe Cys Phe Leu Tyr Met Phe val Cys Leu Cys Pro His 
115 120 125 

Ala Lys Asn lie Lys Phe Ser Leu Lys Ser Thr Leu Pro lie Gly Ala 
130 135 140 

* 

Gly Leu Gly Ser Ser Ala Ser lie Ser val Ser Leu Ala Leu Ala Met 
145 150 155 160 

Ala Tyr Leu Gly Gly Leu lie Gly Ser Asn Asp Leu Glu Lys Leu Ser 

165 170 175 

Glu Asn Asp Lys His lie val Asn Gin Trp Ala Phe lie Gly Glu Lys 

180 185 190 

Cys lie His Gly Thr Pro Ser Gly lie Asp Asn Ala Val Ala Thr Tyr 
195 200 205 

Gly Asn Ala Leu Leu Phe Glu Lys Asp Ser His Asn Gly Thr lie Asn 
210 215 220 

Thr Asn Asn Phe Lys Phe Leu Asp Asp Phe Pro Ala lie Pro Met lie 
225 230 235 240 

Leu Thr Tyr Thr Arg lie Pro Arg Ser Thr Lys Asp Leu val Ala Arg 

245 250 255 

val Arg val Leu Val Thr Glu Lys Phe Pro Glu Val Met Lys Pro lie 

260 265 270 

Leu Asp Ala Met Gly Glu Cys Ala Leu Gin Gly Leu Glu lie Met Thr 
275 280 285 

Lys Leu Ser Lys Cys Lys Gly Thr Asp Asp Glu Ala val Glu Thr Asn 
290 295 300 

Asn Glu Leu Tyr Glu Gin Leu Leu Glu Leu lie Arg lie Asn His Gly 
305 310 315 320 

Leu Leu val ser He Gly val Ser His Pro Gly Leu Glu Leu He Lys 
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325 330 335 



Asn Leu Ser Asp Asp Leu Arg lie Gly Ser Thr Lys Leu Thr Gly Ala 

340 345 350 

Gly Gly Gly Gly Cys Ser Leu Thr Leu Leu Arg Arg Asp lie Thr Gin 
355 360 365 

Glu Gin lie Asp Ser Phe Lys Lys Lys Leu Gin Asp Asp Phe Ser Tyr 
370 375 380 

Glu Thr Phe Glu Thr Asp Leu Gly Gly Thr Gly Cys Cys Leu Leu Ser 
385 390 395 400 

Ala Lys Asn Leu Asn Lys Asp Leu Lys lie Lys Ser Leu Val Phe Gin 

405 410 415 

Leu Phe Glu Asn Lys Thr Thr Thr Lys Gin Gin ile Asp Asp Leu Leu 

420 425 430 

Leu Pro Gly Asn Thr Asn Leu Pro Trp Thr Ser 
435 440 



<210> 7 
<211> 404 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 7 

Met Ser Lys Ser Leu Ile val Ser ser Pro Gly Lys Thr lie Leu Phe 
15 10 15 

Gly Glu His Ala val Val Tyr Gly Ala Thr Ala Leu Ala Ala Ala val 

20 25 30 

Ser Leu Arg Ser Tyr Cys Lys Leu Gin Thr Thr Asn Asn Asn Glu lie 
35 40 45 

val lie val Met Ser Asp lie Gly Thr Glu Arg Arg Trp Asn Leu Gin 
50 55 60 

Ser Leu Pro Trp Gin His val Thr val Glu Asn val Gin His Pro Ala 
65 70 75 80 

Ser Ser Pro Asn Leu Asp Leu Leu Gin Gly Leu Gly Glu Leu Leu Lys 

85 90 95 

Asn Glu Glu Asn Gly Leu Ile His Ser Ala Met Leu Cys Thr Leu Tyr 
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100 105 110 

Leu Phe Thr Ser Leu Ser Ser Pro Ser Gin Gly Cys Thr Leu Thr lie 
115 120 125 

Ser ser Gin val Pro Leu Gly Ala Gly Leu Gly Ser Ser Ala Thr lie 
130 135 140 



Ser val val val Ala Thr Ser Leu Leu Leu Ala Phe Gly Asn lie Glu 
145 150 155 160 

Pro Pro Ser Ser Asn Ser Leu Gin Asn Asn Lys Ala Leu Ala Leu lie 

165 170 175 

Glu Ala Trp ser Phe Leu Gly Glu Cys Cys lie His Gly Thr Pro Ser 

180 185 190 

Gly lie Asp Asn Ala val Ala Thr Asn Gly Gly Leu lie Ala Phe Arg 
195 200 205 

Lys Ala Thr Ala His Gin Ser Ala Met Lys Glu Phe Leu Lys Pro Lys 
210 215 220 

Asp Thr Leu Ser val Met lie Thr Asp Thr Lys Gin Pro Lys Ser Thr 
225 230 235 240 

Lys Lys Leu val Gin Gly val Phe Glu Leu Lys Glu Arg Leu Pro Thr 

245 250 255 

Val lie Asp Ser lie lie Asp Ala lie Asp Gly lie Ser Lys Ser Ala 

260 265 270 

val Leu Ala Leu Thr Ser Glu Ser Asp Lys Asn Ser Ser Ala Lys Lys 
275 280 285 

Leu Gly Glu Phe lie val Leu Asn Gin Lys Leu Leu Glu Cys Leu Gly 
290 295 300 

val Ser His Tyr Ser lie Asp Arg val Leu Gin Ala Thr Lys ser lie 
305 310 315 320 

Gly Trp Thr Lys Leu Thr Gly Ala Gly Gly Gly Gly Cys Thr lie Thr 

325 330 335 

Leu Leu Thr Pro Glu Cys Lys Glu Glu Glu Phe Lys Leu Cys Lys Glu 

340 345 350 
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Ser Leu Leu Ala His Lys Asn Ser lie Tyr Asp val Gin Leu Gly Gly 
355 360 365 

Pro Gly val Ser val Val Thr Asp Ser Asp Ser Phe Phe Pro Gin Tyr 
370 375 380 



Glu Ser Asp Phe Asp Phe Lys Lys Leu Asn Leu Leu Ser Lys Phe Asn 
385 390 395 400 



Lys Tyr Tyr lie 



<210> 8 
<211> 335 
<212> PRT 

<213> Pyrococcus abyssi 
<400> 8 

Met Pro Arg Leu val Leu Ala Ser Ala Pro Ala Lys lie lie Leu Phe 
15 10 15 

Gly Glu His Ser val val Tyr Gly Lys Pro Ala lie Ala Ser Ala lie 

20 25 30 

Asp Leu Arg Thr Tyr val Arg Ala Glu Phe Asn Asp Ser Gly Asn lie 
35 40 45 

Lys lie Glu Ala His Asp lie Lys Thr Pro Gly Leu lie Val Ser Phe 
50 55 60 

Ser Glu Asp Lys lie Tyr Phe Glu Thr Asp Tyr Gly Lys Ala Ala Glu 
65 70 75 80 

Val Leu Ser Tyr val Arg His Ala lie Glu Leu val Leu Glu Glu Ala 

85 90 95 

Asp Lys Arg Thr Gly val Ser Val Ser lie Thr ser Gin lie Pro val 

100 105 110 

Gly Ala Gly Leu Gly ser ser Ala Ala val Ala val Ala Thr lie Gly 
115 120 125 

Ala val Ser Lys Leu Leu Asp Leu Glu Leu Ser Lys Glu Glu He Ala 
130 135 140 

Lys Met Gly His Lys val Glu Leu Leu val Gin Gly Ala Ser ser Gly 
145 150 155 160 
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lie Asp Pro Thr val Ser Ala lie Gly Gly Phe Leu Tyr Tyr Lys Gin 

165 170 175 

Gly Glu Phe Glu His Leu Pro Phe val Glu Leu Pro lie val Val Gly 

180 185 190 

Tyr Thr Gly Ser ser Gly Ser Thr Lys Glu Leu val Ala Met Val Arg 
195 200 205 

Arg Arg Tyr Glu Glu Met Pro Glu Leu lie Glu Pro lie Leu Glu Ser 
210 215 220 

Met Gly Lys Leu Val Asp Lys Ala Lys Glu val lie lie Ser Lys Leu 
225 230 235 240 

Asp Glu Glu Glu Lys Phe Leu Lys Leu Gly Glu Leu Met Asn lie Asn 

245 250 255 

His Gly Leu Leu Asp Ala Leu Gly Val Ser Thr Lys Lys Leu ser Glu 

260 265 270 

Leu val Tyr Ala Ala Arg Thr Ala Gly Ala lie Gly Ala Lys Leu Thr 
275 280 285 

Gly Ala Gly Gly Gly Gly Cys Met Tyr Ala Leu Ala Pro Gly Lys Gin 
290 295 300 

Arg Glu val Ala Thr Ala lie Lys lie Ala Gly Gly Thr Pro Met lie 
305 310 315 320 

Thr Arg lie Ser Lys Glu Gly Leu Arg lie Glu Glu Val Arg Glu 

325 330 335 



<210> 9 

<211> 335 

<212> PRT 

<213> Pyrococcus horikoshii 

<400> 9 

Met val Lys Tyr val Leu Ala Ser Ala Pro Ala Lys val lie Leu Phe 
1 5 10 15 

Gly Glu His Ser val val Tyr Gly Lys Pro Ala lie Ala Ser Ala lie 

20 25 30 

Glu Leu Arg Thr Tyr Val Arg Ala Gin Phe Asn Asp Ser Gly Asn lie 
35 40 45 
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Lys lie Glu Ala His Asp lie Lys Thr Pro Gly Leu lie val Ser Phe 
50 55 60 

Ser Glu Asp Lys lie Tyr Phe Glu Thr Asp Tyr Gly Lys Ala Ala Glu 
65 70 75 80 

Val Leu Ser Tyr val Arg Tyr Ala lie Glu Leu Ala Leu Glu Glu Ser 

85 90 95 

Asp Lys Arg val Gly lie Asp Val Ser lie Thr Ser Gin lie Pro val 

100 105 110 

Gly Ala Gly Leu Gly Ser ser Ala Ala val Ala val Ala Thr He Gly 
115 120 125 

Ala val Ser Arg Leu Leu Gly Leu Glu Leu Ser Lys Glu Glu lie Ala 
130 135 140 

Lys Leu Gly His Lys Val Glu Leu Leu Val Gin Gly Ala Ser Ser Gly 
145 150 155 160 

lie Asp Pro Thr Val Ser Ala Val Gly Gly Phe Leu Tyr Tyr Lys Gin 

165 170 175 

Gly Lys Phe Glu Pro Leu Pro Phe Met Glu Leu Pro lie Val val Gly 

180 185 190 

Tyr Thr Gly Ser Thr Gly Ser Thr Lys Glu Leu Val Ala Met Val Arg 
195 200 205 

Lys Arg Tyr Glu Glu Met Pro Glu Leu Val Glu Pro lie Leu Glu Ala 
210 215 220 

Met Gly Lys Leu val Asp Lys Ala Lys Glu lie lie Leu Ser Lys Leu 
225 230 235 240 

Asp Glu Glu Glu Lys Leu Thr Lys Leu Gly Glu Leu Met Asn lie Asn 

245 250 255 

His Gly Leu Leu Asp Ala Leu Gly val Ser Thr Lys Lys Leu Gly Glu 

260 265 270 

Leu val Tyr Ala Ala Arg Thr Ala Gly Ala lie Gly Ala Lys Leu Thr 
275 280 285 

Gly Ala Gly Gly Gly Gly Cys Met Tyr Ala Leu Ala Pro Gly Arg Gin 
290 295 300 
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Arg Glu val Ala Thr Ala lie Lys lie Ala Gly Gly lie Pro Met lie 
305 310 315 320 

Thr Arg val Ser Arg Glu Gly Leu Arg lie Glu Glu Val Ser Arg 

325 330 335 



<210> 10 
<211> 334 
<212> PRT 

<213> Pyrococcus furiosus 
<400> 10 

Met Lys val lie Ala Ser Ala Pro Ala Lys val lie Leu Phe Gly Glu 
15 10 15 

His Ser val val Tyr Gly Lys Pro Ala lie Ala Ala Ala lie Asp Leu 

20 25 30 

Arg Thr Phe val Glu Ala Glu Leu lie Arg Glu Lys Lys lie Arg lie 
35 40 45 

Glu Ala His Asp lie Lys val Pro Gly Leu Thr val ser Phe Ser Glu 
50 55 60 

Asn Glu lie Tyr Phe Glu Thr Asp Tyr Gly Lys Ala Ala Glu val Leu 
65 70 75 80 

Ser Tyr val Arg Glu Ala lie Asn Leu val Leu Glu Glu Ala Asp Lys 

85 90 95 

Lys Asn val Gly lie Lys Val Ser lie Thr Ser Gin lie Pro val Gly 

100 105 110 

Ala Gly Leu Gly Ser Ser Ala Ala val Ala Val Ala Thr lie Gly Ala 
115 120 125 

val Ser Lys Leu Leu Gly Leu Glu Leu Ser Lys Glu Glu lie Ala Lys 
130 135 140 

Met Gly His Lys Thr Glu Leu Leu val Gin Gly Ala Ser Ser Gly lie 
145 150 155 160 

Asp Pro Thr val Ser Ala lie Gly Gly Phe lie Phe Tyr Glu Lys Gly 

165 170 175 

Lys Phe Glu His Leu Pro Phe Met Glu Leu Pro lie val Val Gly Tyr 

180 185 190 
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Thr Gly Ser Ser Gly Pro Thr Lys Glu Leu Val Ala Met val Arg Lys 
195 200 205 

Arg Tyr Glu Glu Met Pro Glu Leu lie Val Pro lie Leu Glu Ala Met 
210 215 220 

Gly Lys val val Glu Lys Ala Lys Asp Val lie Leu Ser Asn val Asp 
225 230 235 240 

Lys Glu Glu Lys Phe Glu Arg Leu Gly Val Leu Met Asn lie Asn His 

245 250 255 

Gly Leu Leu Asp Ala Leu Gly val Ser Thr Lys Lys Leu Ser Glu Leu 

260 265 270 

val Tyr Ala Ala Arg Val Ala Gly Ala Leu Gly Ala Lys lie Thr Gly 
275 280 285 

Ala Gly Gly Gly Gly Cys Met Tyr Ala Leu Ala Pro Asn Lys Gin Arg 
290 295 300 

Glu val Ala Thr Ala lie Arg lie Ala Gly Gly Thr Pro Met lie Thr 
305 310 315 320 

Glu lie Ser Arg Glu Gly Leu Lys lie Glu Glu Val lie Lys 

325 330 

<210> 11 
<211> 303 
<212> PRT 

<213> Methanobacterium thermoautotrophicum 
<400> 11 

Met Lys Ser Ser Ala Ser Ala Pro Ala Lys Ala lie Leu Phe Gly Glu 
15 10 15 

His Ala val val Tyr ser Lys Pro Ala lie Ala Ala Ala lie Asp Arg 

20 25 30 

Arg val Thr val Thr val Ser Glu Ser ser Ser Thr His val Thr lie 
35 40 45 

Pro Ser Leu Gly lie Arg His Ser Ser Glu Arg Pro Ser Gly Gly lie 
50 55 60 

Leu Asp Tyr lie Gly Arg cys Leu Glu Leu Tyr His Asp Ala Ser Pro 
65 70 75 80 
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Leu Asp lie Arg val Glu Met Glu lie Pro Ala Gly Ser Gly Leu Gly 

85 90 95 

Ser Ser Ala Ala Leu Thr Val Ala Leu lie Gly Ala Leu Asp Arg Tyr 

100 105 110 

His Gly Arg Asp His Gly Pro Gly Glu Thr Ala Ala Arg Ala His Arg 
115 120 125 

val Glu val Asp val Gin Gly Ala Ala Ser Pro Leu Asp Thr Ala lie 
130 135 140 

Ser Thr Tyr Gly Gly Leu val Tyr Leu Asp Ser Gin Arg Arg val Arg 
145 150 155 160 

Gin Phe Glu Ala Asp Leu Gly Asp Leu val lie Ala His Leu Asp Tyr 

165 170 175 

Ser Gly Glu Thr Ala Arg Met Val Ala Gly val Ala Glu Arg Phe Arg 

180 185 190 

Arg Phe Pro Asp lie Met Gly Arg lie Met Asp Thr Val Glu Ser lie 
195 200 205 

Thr Asn Thr Ala Tyr Arg Glu Leu Leu Arg Asn Asn Thr Glu Pro Leu 
210 215 220 

Gly Glu Leu Met Asn Leu Asn Gin Gly Leu Leu Asp Ser Met Gly val 
225 230 235 240 

Ser Thr Arg Glu Leu Ser Met Met val Tyr Glu Ala Arg Asn Ala Gly 

245 250 255 

Ala Ala Gly Ser Lys lie Thr Gly Ala Gly Gly Gly Gly Ser lie lie 

260 265 270 

Ala His Cys Pro Gly cys val Asp Asp Val Val Thr Ala Leu Asn Arg 
275 280 285 

Asn Trp Lys Ala Met Arg Ala Glu Phe Ser val Lys Gly Leu lie 
290 295 300 

<210> 12 
<211> 284 
<212> PRT 

<213> Archaeoglobus fulgidus 
<400> 12 
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Met lie Ala Ser Ala Pro Gly Lys lie lie Leu Phe Gly Glu His Ala 
15 10 15 

val val Tyr Gly Arg His Ala Val val Ser Ala lie Asn Leu Arg Cys 

20 25 30 

Arg val ser val Arg Lys Ser Asp Arg Phe Leu lie Arg Ser Ser Leu 
35 40 45 

Gly Glu Ser Gly Leu Asp Tyr Gin Arg His Pro Tyr val val Gin Ala 
50 55 60 

Val Lys Arg Phe Gly Glu Leu Arg Asn lie Pro Gly Ala Glu lie Glu 
65 70 75 80 

lie Glu Ser Glu lie Pro lie Gly Ser Gly Leu Gly Ser Ser Ala Ala 

85 90 95 

val lie val Ala Thr lie Ala Ala Leu Asn Ala Glu Phe Asp Gly Asp 

100 105 110 

Met Asp Lys Glu Ala lie Phe Gin Met Ala Lys Gin val Glu Tie Asp 
115 120 125 

val Gin Gly Arg Ala Ser Gly lie Asp Pro Phe lie Ser Thr Phe Gly 
130 135 140 

Gly Ser Trp Leu Phe Pro Glu Arg Arg Lys Val Glu Met Pro Phe Lys 
145 150 155 160 

Phe Phe val lie Asn Phe Gly ser Arg ser Thr Ala Glu Met Val Ala 

165 170 175 

Lys val Ala Glu Leu Arg Glu Arg His Pro Glu Val Val Asp Lys lie 

180 185 190 

Phe Asp Ala lie Asp Ala lie Ser Leu Glu Ala Ser Asp Val Gly Ser 
195 200 205 

Ala Glu Arg Leu Glu Glu Leu He Ala lie Asn Gin Ser Leu Leu Arg 
210 215 220 

Ala lie Gly val Ser Asn Pro Glu lie Asp Arg Thr lie Ala Glu Leu 
225 230 235 240 

Glu Arg Met Gly Leu Asn Ala Lys lie Thr Gly Ala Gly Gly Gly Gly 

245 250 255 
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Cys lie Phe Gly Leu Phe Lys Gly Glu Lys Pro Lys Gly Ser Phe lie 

260 265 270 

val Glu Pro Glu Lys Glu Gly val Arg lie Glu Glu 
275 280 



<210> 13 
<211> 312 
<212> PRT 

<213> Methanococcus jannaschii 
<400> 13 

Met lie lie Glu Thr Pro Ser Lys Val lie Leu Phe Gly Glu His Ala 
15 10 15 

val Val Tyr Gly Tyr Arg Ala lie Ser Met Ala lie Asp Leu Thr Ser 

20 25 30 v 

Thr lie Glu lie Lys Glu Thr Gin Glu Asp Glu lie lie Leu Asn Leu 
35 40 45 

Asn Asp Leu Asn Lys Ser Leu Gly Leu Asn Leu Asn Glu lie Lys Asn 
50 55 60 

lie Asn Pro Asn Asn Phe Gly Asp Phe Lys Tyr Cys Leu Cys Ala lie 
65 70 75 80 

Lys Asn Thr Leu Asp Tyr Leu Asn lie Glu Pro Lys Thr Gly Phe Lys 

85 90 95 

lie Asn lie Ser Ser Lys lie Pro lie Ser Cys Gly Leu Gly Ser ser 

100 105 110 

Ala Ser lie Thr He Gly Thr lie Lys Ala val ser Gly Phe Tyr Asn 
115 120 125 

Lys Glu Leu Lys Asp Asp Glu lie Ala Lys Leu Gly Tyr Met val Glu 
130 135 140 

Lys Glu lie Gin Gly Lys Ala Ser lie Thr Asp Thr Ser Thr lie Thr 
145 150 155 160 

Tyr Lys Gly lie Leu Glu lie Lys Asn Asn Lys Phe Arg Lys lie Lys 

165 170 175 

Gly Glu Phe Glu Glu Phe Leu Lys Asn Cys Lys Phe Leu lie Val Tyr 

180 185 190 

Page 20 



21788wo.ST25.txt 

Ala Glu Lys Arg Lys Lys Lys Thr Ala Glu Leu Val Asn Glu val Ala 
195 200 205 

Lys lie Glu Asn Lys Asp Glu lie Phe Lys Glu lie Asp Lys val lie 
210 215 220 



Asp Glu Ala Leu Lys lie Lys Asn Lys Glu Asp Phe Gly Lys Leu Met 
225 230 235 240 

Thr Lys Asn His Glu Leu Leu Lys Lys Leu Asn lie Ser Thr Pro Lys 

245 250 255 

Leu Asp Arg He val Asp lie Gly Asn Arg Phe Gly Phe Gly Ala Lys 

260 265 270 

Leu Thr Gly Ala Gly Gly Gly Gly Cys Val lie lie Leu Val Asn Glu 
275 280 285 

Glu Lys Glu Lys Glu Leu Leu Lys Glu Leu Asn Lys Glu Asp val Arg 
290 295 300 

lie Phe Asn cys Arg Met Met Asn 
305 310 



<210> 14 

<211> 324 

<212> PRT 

<213> Aeropyrum pernix 

<400> 14 

Met Arg Arg Ala Ala Arg Ala Ser Ala Pro Gly Lys Val lie lie Val 
1 5 10 15 

Gly Glu His Phe Val val Arg Gly ser Leu Ala lie val Ala Ala lie 

20 25 30 

Gly Arg Arg Leu Arg val Thr val Arg ser Gly Gly Lys Gly lie val 
35 40 45 

Leu Glu Ser Ser Met Leu Gly Arg His Ser Ala Pro Leu pro Gly Gin 
50 55 60 

Gly Ala Ala Ala Lys val Ser Pro val Leu Glu Pro Tyr lie Ala val 
65 70 75 80 

Leu Arg ser Leu Ala Ala Arg Gly Tyr ser val val Pro His Thr lie 

85 90 95 
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Leu Val Glu Ser Gly lie Pro Pro Arg Ala Gly Leu Gly Ser Ser Ala 

100 105 110 

Ala Ser Met val Ala Tyr Ala Leu Ser Tyr ser Ala Met His Gly Asp 
115 120 125 

Pro Leu Ser Ala Glu Asp Leu Tyr ser Val Ala Met Glu Gly Glu Lys 
130 135 140 

lie Ala His Gly Lys Pro Ser Gly val Asp val Thr lie Ala val Arg 
145 150 155 160 

Gly Gly Val Leu Ala Tyr Arg Arg Gly Glu Asn Pro Val Asp lie Arg 

165 170 175 

Pro Gly Leu Thr Gly val Thr Leu Leu val Ala Asp Thr Gly val Glu 

180 185 190 

Arg Arg Thr Arg Asp Val Val Glu His Val Leu Ser lie Ala Asp Ala 
195 200 205 

Leu Gly Glu Ala Ser Thr Tyr lie Tyr Arg Ala Ala Asp Leu lie Ala 
210 215 220 

Arg Glu Ala Leu His Ala lie Glu Lys Gly Asp Ala Glu Arg Leu Gly 
225 230 235 240 

Leu lie Met Asn Ala Ala Gin Gly Leu Leu Ser Ser Leu Gly Ala Ser 

245 250 255 

Ser Leu Glu lie Glu Thr Leu Val Tyr Arg Met Arg Ser Ala Gly Ala 

260 265 270 

Leu Gly Ala Lys Leu Thr Gly Ala Gly Trp Gly Gly Cys Val lie Gly 
275 280 285 

Leu Phe Lys Glu Gly Glu val Glu Arg Gly Leu Glu Ser val val Glu 
290 295 300 

Ser ser Ser Gin Ala Phe Thr Ala Ser lie Ala Glu Glu Gly Ala Arg 
305 310 315 320 

Leu Glu Glu Phe 



<210> 15 
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<211> 387 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> His6-Paracoccus zeaxanthinifaciens 
<400> 15 

Met Arg Gly Ser His His His His His His Ser Thr Gly Arg Pro Glu 
15 10 15 

Ala Gly Ala His Ala Pro Gly Lys Leu lie Leu Ser Gly Glu His Ser 

20 25 30 

Val Leu Tyr Gly Ala Pro Ala Leu Ala Met Ala lie Ala Arg Tyr Thr 
35 40 45 

Glu val Trp Phe Thr Pro Leu Gly lie Gly Glu Gly lie Arg Thr Thr 
50 55 60 

Phe Ala Asn Leu Ser Gly Gly Ala Thr Tyr Ser Leu Lys Leu Leu Ser 
65 70 75 80 

Gly Phe Lys Ser Arg Leu Asp Arg Arg Phe Glu Gin Phe Leu Asn Gly 

85 90 95 

Asp Leu Lys val His Lys val Leu Thr His Pro Asp Asp Leu Ala val 

100 105 110 

Tyr Ala Leu Ala Ser Leu Leu His Asp Lys Pro Pro Gly Thr Ala Ala 
115 120 125 

Met Pro Gly lie Gly Ala Met His His Leu Pro Arg Pro Gly Glu Leu 
130 135 140 

Gly Ser Arg Thr Glu Leu Pro lie Gly Ala Gly Met Gly Ser Ser Ala 
145 150 155 160 

Ala lie val Ala Ala Thr Thr val Leu Phe Glu Thr Leu Leu Asp Arg 

165 170 175 

Pro Lys Thr Pro Glu Gin Arg Phe Asp Arg val Arg Phe Cys Glu Arg 

180 185 190 

Leu Lys His Gly Lys Ala Gly Pro lie Asp Ala Ala Ser val val Arg 
195 200 205 

Gly Gly Leu val Arg val Gly Gly Asn Gly Pro Gly Ser lie Ser Ser 
210 215 220 
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Phe Asp Leu Pro Glu Asp His Asp Leu Val Ala Gly Arg Gly Trp Tyr 
225 230 235 240 

Trp val Leu His Gly Arg Pro Val Ser Gly Thr Gly Glu Cys val Ser 

245 250 255 

Ala Val Ala Ala Ala His Gly Arg Asp Ala Ala Leu Trp Asp Ala Phe 

260 265 270 

Ala val Cys Thr Arg Ala Leu Glu Ala Ala Leu Leu Ser Gly Gly Ser 
275 280 285 

Pro Asp Ala Ala lie Thr Glu Asn Gin Arg Leu Leu Glu Arg lie Gly 
290 295 300 

Val Val Pro Ala Ala Thr Gin Ala Leu val Ala Gin lie Glu Glu Ala 
305 310 315 320 

Gly Gly Ala Ala Lys lie Cys Gly Ala Gly Ser val Arg Gly Asp His 

325 330 335 

Gly Gly Ala val Leu val Arg lie Asp Asp Ala Gin Ala Met Ala Ser 

340 345 350 

val Met Ala Arg His Pro Asp Leu Asp Trp Ala Pro Leu Arg Met Ser 
355 360 365 

Arg Thr Gly Ala Ala Pro Gly Pro Ala Pro Arg Ala Gin Pro Leu Pro 
370 375 380 

Gly Gin Gly 
385 

<210> 16 
<211> 1137 
<212> DNA 

<213> Paracoccus zeaxanthinifaciens 
<400> 16 

atgtcgaccg gcaggcctga agcaggcgcc catgccccgg gcaagctgat cctgtccggg 60 

gaacattccg tgctctatgg tgcgcccgcg cttgccatgg ccatcgcccg ctataccgag 120 

gtgtggttca cgccgcttgg cattggcgag gggatacgca cgacattcgc caatctctcg 180 

ggcggggcga cctattcgct gaagctgctg tcggggttca agtcgcggct ggaccgccgg 240 

ttcgagcagt tcctgaacgg cgacctaaag gtgcacaagg tcctgaccca tcccgacgat 300 

ctggcggtct atgcgctggc gtcgcttctg cacgacaagc cgccggggac cgccgcgatg 360 
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ccgggcatcg 


gcgcgatgca 


ccacctgccg 


cgaccgggtg 


agctgggcag 


ccggacggag 


420 


ctgcccatcg 


gcgcgggcat 


ggggtcgtct 


gcggccatcg 


tcgcggccac 


cacggtcctg 


480 


ttcgagacgc 


tgctggaccg 


gcccaagacg 


cccgaacagc 


gcttcgaccg 


cgtccgcttc 


540 


tgcgagcggt 


tgaagcacgg 


caaggccggt 


cccatcgacg 


cggccagcgt 


cgtgcgcggc 


600 


gggcttgtcc 


gcgtgggcgg 


gaacgggccg 


ggttcgatca 


gcagcttcga 


tttgcccgag 


660 


gatcacgacc 


ttgtcgcggg 


acgcggctgg 


tactgggtac 


tgcacgggcg 


ccccgtcagc 


720 


gggaccggcg 


aatgcgtcag 


cgcggtcgcg 


gcggcgcatg 


gtcgcgatgc 


ggcgctgtgg 


780 


gacgccttcg 


cagtctgcac 


ccgcgcgttg 


gaggccgcgc 


tgctgtctgg 


gggcagcccc 


840 


yaLyLLL)LLd 


LLaLLyayad 


LLaijLyLU Ly 




LL.yyL9LC.yL 


yLcgycdycy 




acgcaggccc 


tcgtggccca 


gatcgaggag 


gcgggtggcg 


cggccaagat 


ctgcggcgca 


960 


ggttccgtgc 


ggggcgatca 


cggcggggcg 


gtcctcgtgc 


ggattgacga 


cgcgcaggcg 


1020 


atggcttcgg 


tcatggcgcg 


ccatcccgac 


ctcgactggg 


cgcccctgcg 


catgtcgcgc 


1080 


acgggggcgg 


cacccggccc 


cgcgccgcgt 


gcgcaaccgc 


tgccggggca 


gggctga 


1137 


<210> 17 

<211> 1191 

<212> DNA 

<213> homo sapiens 












<400> 17 
atgttgtcag 


aagtcctact 


ggtgtctgct 


ccggggaaag 


tcatccttca 


tggagaacat 


60 


gccgtggtac 


atggcaaggt 


agcactggct 


gtatccttga 


acttgagaac 


attcctccgg 


120 


cttcaacccc 


acagcaatgg 


gaaagtggac 


ctcagcttac 


ccaacattgg 


tatcaagcgg 


180 


gcctgggatg 


tggccaggct 


tcagtcactg 


gacacaagct 


ttctggagca 


aggtgatgtc 


240 


acaacaccca 


cctcagagca 


agtggagaag 


ctaaaggagg 


ttgcaggctt 


gcctgacgac 


300 


tgtgctgtca 


ccgagcgcct 


ggctgtgctg 


gcctttcttt 


acttatacct 


gtccatctgc 


360 


cggaagcaga 


gggccctgcc 


gagcctggat 


atcgtagtgt 


ggtcggagct 


gccccccggg 


420 


gcgggcttgg 


gctccagcgc 


cgcctactcg 


gtgtgtctgg 


cagcagccct 


cctgactgtg 


480 


tgcgaggaga 


tcccaaaccc 


gctgaaggac 


ggggattgcg 


tcaacaggtg 


gaccaaggag 


540 


gatttggagc 


taattaacaa 


gtgggccttc 


caaggggaga 


gaatgattca 


cgggaacccc 


600 


tccggagtgg 


acaatgctgt 


caacacctaa 


aaaaaaaccc 


tccaatacca 


tcaaaaaaaa 


660 


atttcatcct 


taaagaggtc 


gccagctctc 


cagatcctgc 


tgaccaacac 


caaagtccct 


720 


cgcaatacca 


gggcccttgt 


ggctggcgtc 


agaaacaggc 


tgctcaagtt 


cccagagatc 


780 


gtggcccccc 


tcctgacctc 


aatagatgcc 


atctccctgg 


agtgtgagcg 


cgtgctggga 


840 


gagatggggg 


aagccccagc 


cccggagcag 


tacctcgtgc 
Page 


tggaagagct 
25 


cattgacatg 


900 









21788wo.ST25.txt 






aaccagcacc 


atctgaatgc 


cctcggcgtg 


ggccacgcct 


ctctggacca 


gctctgccag 


960 


gtgaccaggg 


cccgcggact 


tcacaqcaaq 


ctgactggcg 


caggcggtgg 


tqqctqtqqc 


1020 


atcacactcc 


tcaagccagg 


gctggagcag 


ccagaagtgg 


aggccacgaa 


gcaggccctg 


1080 


accagctgtg gctttgactg 


cttggaaacc 


a n r a 1* r fin t* n 

ay v. a. l v_yy w y 


c recrement 


ctccatccac 


1140 


tcagccacct 


ccctggacag 


ccgagtccag 


caagccctgg 


atggcctctg 


a 


1191 


<210> 18 
<211> 1188 
<212> DNA 
<213> mouse 












<400> 18 
atqttqtcaq 


aaqccctqct 


ggtgtccgcc 


ccqqqqaaqq 


tcatcctcca 


tggagaacac 


60 


actataqtcc 

3 w jj 33 »7 www 


atqqcaaqqt 


cgctctggca 


qcqqccttqa 

33 33 33 ^ — 3 


acttqaqaac 


tttcctcctg 


120 


ctqcqaccqc 


aqaqcaatqq 


gaaagtgagc 


qtcaatttac 


ccaacatcqq 


tattaagcag 


180 


qtqtqqqatq 


tqqqcatqct 


tcagcgactg 


qacacqaqct 


ttcttqaqca 


aggtgatgtc 


240 


tcqqtaccca 


ccttqqaqca 


actggagaag 


ctaaaqaaaa 


toqqqqacct 


ccccagagac 


300 


catacaaqca 


ataaaaacat 


ggctctgctt 


qcctttctct 

*J www w w w» w 


acctatacct 

V* V* W W ^4 W W\ \» W 


ggcaatctgc 


y rt 

360 


caaaaacaaa 


aaacactccc 

Wf W\ X* \A X* W W* X* W« 


gagcctggac 


atoatqatat 


aatcaaaact 


tccccccggg 


420 


gcaggcttgg 


gctccagcgc 


cgcctactct 


qtatatctoo 

33 w 33 w 33 wv "^33 33 


caqccaccct 


cctgactgcc 


^ rt rt 
480 


tgtgaggagg 


tctccaaccc 


gctcaaggac 


aqaatctcca 

33 3357 ;J *•>" w W33 


tcaacaaata 


geccgaggaa 


540 


gatctgaagt 


caatcaacaa 


gtgggccttc 


qaaqqqqaqa 

33 33 33 JJ **3 ** 


qaataatcca 


tgggaaccct 


rt rt 

600 


tctggtgtgg 


acaatgccgt 


cagcacctgg 


qqcqqaqccc 

33 33 37 w; 


tqcqcttcca 


gcaagggacg 


^* /~ rt 
660 


atgtcttcct 


tgaagagcct 


cccgtctctg 


caqatcctqc 


tcaccaacac 


caaggtcccg 


^ •> rt 
720 


cggagtacca 


aggcccttgt 


ggctgctgtc 


aqaaqcaqqc 

*■* 33 33 ***33 33 w 


tqaccaaatt 


ccctgagatt 


O rt 

780 


gtggccccgc 


tgctgacctc 


cattgacgca 


atatccctqq 


aotataaaca 


cgtgctaggg 


rt A rt 

840 


gagatggtgg 


cagctccagt 


tccggaacag 


tacctcatac 


taaaaaaact 


gatagacatg 


rt rt rt 
900 


aaccagcacc 


atctgaatgc 


tctcggggtg 


ggccacaact 


ccctggacca 


gctctgccaa 


y\ y% 

960 


gtaacggcag 


cacacggact 


gcacagcaag 


ctgacgggcg 


ctggcggtgg 


cggctgtggc 


1020 


atcaccctcc 


tgaagccagg 


tctagagcaa 


gccacagtgg 


aggcagecaa 


gcaggccctg 


1080 


accagctgcg 


ggtttgactg 


ctgggagacc 


agcatcggcg 


cacccggagt 


ttccacacac 


1140 


tcagctgcag 


ctgtagggga 


ccctgtccga 


caagccctgg 


gectctga 




1188 



<210> 19 
<211> 1188 
<212> DNA 
<213> rat 
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<400> 19 
atgttgtcag 


aagtcctgct 


ggtgtctgct 


ccaqqqaaaq 


tcattctcca 


tggagaacat 


60 


gctgtggtcc 


atggcaaggt 


agctctggcg 


Qtqqccttqa 


acttqaqaac 


atttctcgtg 


120 


ctgcgaccgc 


agagcaatgg 


gaaagtgagc 


ctcaatttac 


caaacatcqq 


tattaagcag 


180 


gtctgggatg 


tggccacact 


tcagctgctg 


qacacaqqct 


ttcttaaaca 


aggcgatgtc 


240 


ccggcaccca 


ccttggagca 


actggagaag 


ctqaaqaaqq 


taacaaacct 

w jjJ7 «733 


cccccgagac 


300 


tgtgtaggca 


acgaaggcct 


gtctctgctt 


qcctttctqt 


acctqtacct 


ggctatctgc 


360 


cggaaacaga 


ggacactccc 


aagcctggac 


atcatqqtqt 


qqtcqqaact 


gccccctggg 


420 


gcgggcttgg 


gctccagtgc 


agcctactcg 


qtqtqtqtqq 

23 23 w 23 w 23 


caqccaccct 


cctgactgcc 


480 


tgtgaggagg 


tcaccaaccc 


gctcaaggac 


aqqqqctcca 


ttaacaatta 

» w 23 23 **23 w *■ 3 


gcccgaggag 


540 


gacctgaagt 


caattaacaa 


gtgggcctac 


qaqqqqqaqa 


qaqtqatcca 


tgggaacccc 


600 


tctggcgtgg 


acaattccgt 


cagcacctgg 


qqaqqaqccc 


tacactacca 


gcaagggaag 


660 


atgtcatcct 


tgaagaggct 


cccagctctg 


caqatcctac 


tcaccaacac 


caaggtccca 


720 


cgaagcacca 


aggccctcgt 


ggctggcgtc 


aaaaacaaac 


taatcaaatt 


ccctgagatc 


780 


atggccccgc 


tcctgacatc 


aattgacgca 


atctccctaa 


aatataaaca 


cgtgctggga 


840 


gagatggcgg 


ccgcaccagt 


cccagaacag 


taccttatcc 


taaaaaaact 


aatggacatg 


900 


aaccagcacc 


atctgaatgc 


ccttggtgtg 


ggccacgcct 


ccctggacca 


gctctgtcag 


960 


ataacaacaa 


cacataaact 




ctgactggcg 


caggcggcgg 


c n n c 1" n 1" n n r* 


1070 


atcaccctcc 


tgaagccagg 


tctagagcga 


gcaaaagtgg 


aggccgccaa 


gcaggccctg 


1080 


accggctgcg 


ggtttgactg 


ctgggagacc 


ay (.a l iyydy 


cyc(_ Lyyyy l 


ctccatgcac 


1140 


tcagccacct 


ccatagagga 


ccctgtccga 


caagccctgg 


gcctctga 


■ 


1188 


<210> 20 
<211> 1137 

<212> DNA 

<213> Arabidopsis thai i ana 










<400> 20 

atggaagtga aagctagagc tcctgggaag 


atcatactta 


caaaaaaaca 

3333 


cgctgttgtt 


60 


catggatcca 


ccgctgtagc 


tgccgccatt 


aatctctaca 


cttacattac 


tctccgcttt 


120 


cctcttccat 


cagctgagaa 


caatgatagg 


cttacacttc 


agctcaagga 


catttccttg 


180 


gagttttcat 


ggtccttagc 


cagaatcaaa 


gaagcgattc 


cttatgattc 


aagcactctc 


240 


tgccgttcta 


cgccggcttc 


atgttcagag 


gagaccctta 


aatcaattgc 


agttttggtt 


300 


gaagagcaaa 


atcttccaaa 


ggaaaagatg 


tggctctcct 


ctgggatctc 


cacgtttctc 


360 


tggttataca 


ccagaattat 


agggttcaat 


ccggctacag 
Page 


tagtcattaa 
27 


ctctgagctt 


420 



ccatacgggt 


ctggcctcgg 


ttcatcagca 
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gctttatgtg tagctctcac 


agctgctctc 


480 


cttgcttctt 


ctatttcaga 


gaaaacccgt 


ggtaacggtt 


ggtcatctct 


cgatgaaacc 


540 


aatcttgagt 


tgctaaataa 


atgggctttc 


gaaggcgaaa 


agatcatcca 


tgggaaacct 


600 


tctgggatag 


acaacaccgt 


cagtgeatae 


ggcaacatga 


tcaagttctg 


etcaggegag 


660 


ataactcggt 


tacaatccaa 


catgcctctg 


agaatgetaa 


ttaccaacac 


tagagttggg 


720 


cgaaacacaa 


aagctctggt 


ctctggtgtg 


tcacagagag 


eggtaagaca 


tectgatgeg 


780 


atgaagtcag 


tgttcaacgc 


cgtggattct 


ataagcaaag 


agctcgctgc 


gatcattcag 


840 


tctaaagacg 


agacctcagt 


tacagaaaaa 


gaagagagaa 


taaaagaact 


catggagatg 


900 




t detect ate 


aataaaaatt 

"O Ly y yyy l l 


agccacagct 


caatcgaggc 


Ly LyQ L LL. LCI 


960 


accacggtca 


agcacaagct 


tgtctccaaa 


cttacaggag 


ctggtggcgg 


cggctgcgtc 


1020 


ctcactctat 


taccaaccgg 


gacggtggtg 


gacaaagtgg 


tggaggagct 


cgagtccagc 


1080 


ggttttcagt 


gtttcaegge 


attgattggt 




rtranatttn 


ctattga 


1137 


<210> 21 
<211> 1332 

<212> DNA 

<213> Saccharomyces cerevisiae 










<400> 21 
atgtcattac 


cgttcttaac 


ttctgcaccg 


ggaaaggtta 


ttatttttgg 


tgaacactct 


60 


gctgtgtaca 


acaagcctgc 


cgtcgctgct 


agtgtgtctg 


cgttgagaac 


ctacctgcta 


120 


ataagcgagt 


catctgcacc 


agatactatt 


gaattggact 


tcccggacat 


tagctttaat 


180 


cataagtggt 


ccatcaatga 


tttcaatgee 


atcaccgagg 


atcaagtaaa 


ctcccaaaaa 


240 


ttggccaagg 


ctcaacaagc 


caccgatggc 


ttgtctcagg 


aactegttag 


tcttttggat 


300 


ccgttgttag 


ctcaactatc 


cgaatccttc 


cactaccatg 


cagcgttttg 


tttcctgtat 


360 


atgtttgttt 


gcctatgccc 


ccatgccaag 


aatattaagt 


tttctttaaa 


gtctacttta 


420 


cccatcggtg 


ctgggttggg 


ctcaagcgcc 


tctatttctg 


tatcactggc 


cttagctatg 


480 


gcctacttgg 


gggggttaat 


aggatctaat 


gacttggaaa 


agctgtcaga 


aaacgataag 


540 


catatagtga 


atcaatgggc 


cttcataggt 


gaaaagtgta 


ttcaeggtae 


cccttcagga 


600 


atagataacg 


ctgtggccac 


ttatggtaat 


gccctgctat 


ttgaaaaaga 


ctcacataat 


660 


ggaacaataa 


acacaaacaa 


ttttaagttc 


ttagatgatt 


tcccagccat 


tccaatgatc 


720 


ctaacctata 


ctagaattcc 


aaggtctaca 


aaagatcttg 


ttgctcgcgt 


tcgtgtgttg 


780 


gtcaccgaga 


aatttcctga 


agttatgaag 


ccaattctag 


atgccatggg 


tgaatgtgcc 


840 


ctacaaggct 


tagagatcat 


gactaagtta 


agtaaatgta 


aaggcaccga 


tgacgaggct 


900 


gtagaaacta 


ataatgaact 


gtatgaacaa 


ctattggaat 
Page 


tgataagaat 
28 


aaatcatgga 


960 



ctgcttgtct 


caatcggtgt 


ttctcatcct 
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ggattagaac ttattaaaaa 


tctgagcgat 


1020 


gatttgagaa 


ttggctccac 


aaaacttacc 


ggtgctggtg 


gcggcggttg 


ctctttgact 


1080 


ttgttacgaa gagacattac tcaagagcaa 


attgacagct 


tcaaaaagaa 


attgcaagat 


1140 


gattttagtt acgagacatt tgaaacagac 


ttgggtggga 


ctggctgctg 


i l Ly l LdayL 




gcaaaaaatt 


tgaataaaga 


tcttaaaatc 


aaatccctag 


tattccaatt 


atttgaaaat 


1260 


aaaactacca 


caaagcaaca 


aattgacgat 


ctattattgc 


caggaaacac 


gaatttacca 


1320 


tggacttcat 


aa 










1332 


<210> 22 
<211> 1215 
<212> DNA 

<213> Schizosaccharomyces pombe 










<400> 22 
atgtcaaaat 


cgcttattgt 


ttcgtcgcca 


ggaaaaacga 


ttttgtttgg 


ggaacatgcc 


60 


gttgtatatg 


gagctactgc 


gttagctgca 


gctgtatcgt 


tacggagtta 


ctgtaaatta 


120 


cagacgacta 


ataacaatga 


aatagtaatt 


gtgatgagtg 


atatagggac 


cgaacgccga 


180 


tggaatcttc 


aatcgctacc 


ttggcagcat 


gtaacagtgg 


aaaacgttca 


gcacccggca 


240 


tcatctccca 


atctggacct 


tttacaagga 


ttaggagagc 


tattaaaaaa 


tgaagaaaac 


300 


ggacttattc 


actcagcaat 


gctttgtacc 


ctttacttgt 


tcacgtcttt 


gtcttctcct 


360 


tctcagggtt 


gtactttaac 


tattagctcc 


caagtacctt 


tgggtgctgg 


attaggtagt 


420 


agtgctacta 


tatcagttgt 


tgtcgctaca 


agtttactac 


tagcttttgg 


taatattgaa 


480 


cctcctagct 


caaattctct 


tcaaaacaac 


aaagcacttg 


cgttgataga 


ggcttggtct 


540 


tttctaggtg 


aatgctgtat 


tcatggaaca 


ccaagtggta 


ttgataatgc 


agtagcaaca 


600 


aatggaggac 


ttatcgcttt 


tcgtaaagct 


acagctcatc 


agagtgccat 


gaaagaattc 


660 


ttaaagccta aagatacctt 


atctgttatg 


atcactgata 


ccaaacaacc 


aaaaagtact 


720 


aaaaaacttg 


tacaaggagt 


ttttgaactg 


aaggaaagac 


taccaactgt 


gattgactca 


780 


ataatagatg 


caatcgatgg 


catatcaaag 


tctgccgtcc 


tcgcattgac 


ttcggagagc 


840 


gataaaaact 


cctccgctaa 


aaagttagga 


gagtttattg 


ttcttaatca 


aaaactctta 


900 


gaatgcttgg 


gtgtatccca 


ttattccatt 


gatcgcgttt 


tacaagccac 


taagtcaatt 


960 


ggatggacga agcttacagg 


tgccggtggt 


ggaggttgta 


cgattacttt 


attaacacct 


1020 


gagtgcaaag 


aagaggaatt 


taagttatgt 


aaagaatcac 


tattagccca 


taaaaattct 


1080 


atttacgatg 


ttcaattagg 


tggacctggt 


gtttcagtgg 


taaccgactc 


agattcattt 


1140 


ttccctcaat 


atgagtctga 


ctttgatttt 


aaaaaattga 


atttactcag 


caaatttaat 


1200 


aaatattata 


tttaa 
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<210> 23 
<211> 1008 
<212> DNA 

<213> Pyrococcus abyssi 



<400> 23 
atgccaaggt 


tagtgctggc 


gtcagctcca 


gcaaagataa 


tactcttcgg 


ggaacacagc 


60 


gttgtgtatg 


gaaagcctgc 


catagcatct 


gctattgact 


tgagaactta 


cgttagggcg 


120 


gagtttaatg 


attcgggaaa 


tataaagata 


gaagcccatg 


acataaaaac 


ccctgggcta 


180 


atagtttctt 


tttcagaaga 


caaaatttac 


ttcgagactg 


actatggaaa ggcagctgaa 


240 


gtgctgagtt 


acgttagaca 


cgccatagag 


ctcgtccttg 


aagaggctga taagaggact 


300 


ggggtcagcg 


tttcaataac 


gtctcaaatt 


ccagtaggtg 


ctggcctagg 


ttcttcagct 


360 


gccgtcgccg 


ttgctaccat 


cggtgccgtc 


tccaagttac 


ttgacctcga 


gcttagtaaa 


420 


gaggagatag 


ctaagatggg 


ccataaggtt 


gaactcctgg 


ttcagggagc 


ttcgagtggc 


480 


atagatccga 


cggtctcggc 


aataggaggg 


ttcttgtact 


ataagcaagg 


tgaatttgag 


540 


cacctaccat 


tcgtggagct 


tccaatagta 


gttggatata 


ccggctcaag 


tggctccaca 


600 


aaggaattag 


ttgcgatggt 


taggagaagg 


tacgaggaga 


tgcccgagtt 


aattgaaccc 


660 


attctagagt 


caatgggtaa 


gctcgtggat 


aaagctaagg 


aggtaataat 


atctaagctc 


720 


gatgaggagg 


aaaagttcct 


gaaattggga 


gagctcatga 


acataaatca 


tggccttctc 


780 


gatgccctag 


gtgtttcaac 


caaaaagcta 


agcgaactcg 


tctatgccgc tagaactgct 


840 


ggagcaattg 


gagccaagct 


aacgggggct 


gggggaggtg 


gatgcatgta 


cgctttagct 


900 


cctgggaagc 


agagggaggt 


tgctacggcc 


ataaagatag 


ctggcggaac 


tcccatgata 


960 


acgaggataa 


gcaaggaggg 


gcttagaata 


gaggaggtaa 


gggaatga 




1008 


<210> 24 
<211> 1008 
<212> DNA 

<213> Pyrococcus horikoshii 










<400> 24 
tcatcttgaa 


acctcctcta 


ttctcaatcc 


ttccctactt 


accctcgtta tcattggaat 


60 


tccaccagct 


atctttattg 


ccgtggcgac 


ttccctctgt 


ctcccagggg 


ccaaggcata 


120 


catacaccct 


ccaccaccag 


cacccgtaag 


ttttgcccca 


attgcaccag 


ccgttctagc 


180 


agcgtaaact 


aactcaccaa gcttcttcgt 


agacaccccc 


aaagcatcta 


aaagcccatg 


240 


atttatgttc 


attaactctc 


caagcttagt 


aagcttctcc 


tcttcatcga gctttgaaag 


300 


tattatctcc 


ttggccttat 


ccactaattt 


tcccattgcc 


tccaatatag 


gctccacaag 


360 


ttcgggcatt 


tcctcgtacc 


ttttccttac 


cattgccact 


aattctttag 


ttgaaccagt 


420 


agagccggtg 


taaccaacga 


ctatgggaag 


ctccatgaat 
Page 


gggagaggct 
30 


caaactttcc 


480 
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ttgcttataa 


tagaggaagc 


ctcccactgc 


agaaactgta ggatcaatgc 


cacttgaagc 


540 


tccctgcact 


agaagttcaa ctttatgccc 


aagctttgct atttcctcct 


tactcaattc 


600 


aaggccaagt 


aacctagaga ccgcaccaat 


tgtagcaact gcaaccgctg 


ctgaggaacc 


660 


caatccggcc 


ccaactggaa 


tttgagaggt 


tattgaaacg tcaataccaa 


ccctcttatc 


720 


oy c*v. v_ v_ \w ill 


agggcaagtt 


ctatggcata 






780 


tccatagtct 


gtttcaaagt 


atatcttatc 


ctcggaaaac gagactatca 


atccaggagt 


840 


ttttatatca 


tgagcttcta 


tttttatatt 


acccgaatca ttgaattgag 


ctctaacata 


900 


tgttctaagt 


tctatagcag 


aagctatagc 


cggttttcca tagactacgc 


tatgttcccc 


960 


gaagagtatt 


acctttgcag 


gagctgaagc 


taaaacgtac ttaaccat 

• 




1008 


<210> 25 
<211> 1005 

<212> DNA 

<213> Pyrococcus furiosus 








<400> 25 
tcatttaatt 


acctcctcaa 


tttttaaccc 


ttctctgctg atttcagtta 


tcataggagt 


60 


tccccctgca 


attcttatag 


ctgtagcaac 


ttctctctgc ttgttcggtg 


ccaaggcata 


120 


catacaacct 


cctcccccag 


ccccagttat 


cttagctcct agggctcccg 


caaccctagc 


180 


cgcgtacact 


agttcactca 


actttttagt 


tgaaacaccc aaagcatcta 


aaagaccgtg 


240 


attaatgttc 


atcaaaaccc 


caagcctttc 


aaatttttcc tccttatcaa 


catttgaaag 


300 


tattacatcc 


ttggctttct 


cgacaacttt 


tcccatagcc tctaatatgg 


gaacaatcaa 


360 


ctcgggcatc 


tcctcatatc 


ttttcctaac 


cattgcaact aactccttag 


ttggacctga 


420 


ggagcccgtg 


tatccaacaa 


ctattggaag 


ctccatgaag ggaaggtgtt 


caaatttacc 


480 


cttttcatag 


aatataaacc 


cccctattgc 


agagaccgtt gggtctatac 


cacttgacgc 


540 


accttgaaca 


agtaattcag 


tcttgtgacc 


catcttagct atttcctcct 


tacttagctc 


600 


aagtcctagt 


aactttgata 


cagccccaat 


tgtggctact gccacggcag 


cagaagagcc 


660 


caatccagca 


ccaactggaa 


tttgagaagt 


tatgctaacc ttaataccaa 


catttttctt 


720 


atctgcctcc 


tctaaaacta 


aatttattgc 


ctctctaaca tagctcaaaa 


cttcagctgc 


780 


ttttccataa 


tctgtctcga 


aatatatctc 


attttcagaa aatgaaaccg 


taagtccagg 


840 


aactttaatg 


tcatgagctt 


caattcttat 


ttttttctcc cgaattagct 


cagcctccac 


900 


aaaagttcgt 


aaatcaatgg 


cagcagctat 


cgctggcttt ccgtaaacta 


cgctatgctc 


960 


tccaaaaaga 


ataacttttg 


cgggagctga 


ggctataact ttcat 




1005 



<210> 26 
<211> 912 
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<212> DNA 

<213> Methanobacterium thermoautotrophicum 



<400> 26 
ttgaagtcgt 


cggcatccgc 


acctgccaag 


gccattcttt ttggtgaaca 


cgcagtggtc 


60 


tacagcaagc 


cggcaatagc 


agccgccata 


gaccgcaggg tgactgtaac 


cgtaagtgaa 


•1 *\ r\ 

120 


tccagcagca 


cccatgtaac 


catcccctcc 


ctgggtatac gccacagttc agagagacca 


180 


tccggtggca 


tcctggacta 


catcgggagg 


tgcctcgagc tttaccatga 


cgcatcaccc 


240 


cttgacatca 


gggtggagat 


ggagataccc 


gccggttcag gcctaggttc atcggctgca 


300 


ctcaccgttg 


cactgatagg 


tgccctcgac 


aggtaccatg gaagggatca tggacccggg 


360 


gagacagcag 


ccagggccca 


cagggtggag 


gttgatgtac agggagccgc 


cagccccctt 


420 


gacacagcca 


tcagcaccta 


tgggggcctt 


gtataccttg acagccagag 


gagggtgagg 


480 


cagtttgagg 


ccgacctggg 


ggaccttgta 


dLciyLaLdLL LLydLlatLL 


aggggaaaca 


540 


gccaggatgg 


ttgccggcgt 


agctgaaagg 


■f - "f - f ^ rt rt ^ rt "3 +* "t" f f r* rt rt 3 "t* ra +■ 

L LLdygdgd u xcccggaiaT. 


r'^'4'rtrtrtrt^rtrt 

caxggggagg 


600 


ataataaaca 




cataaccaat 


3 r o rt r* 3 + a /*• o rt rt rt ^ r" *f- +■ r~ +■ 
dCdgCdtdCd gggddCLLCL 


aaggaacaac 


660 

W WW 


acagaacccc 


tgggggagct 


catgaacctc 


^ ^ t~ r"^ rt rt rt rt r~ +* rt r* +■ rt rt ■a r* +■ r* 

aaccaggggc LgcrggacLC 


r* rtrtrtr* /i +■ +■ 

cargggcgx l 


720 


tccacacgtg 


aactttcaat 


gatggtctat 


rt ^ rt rt /*• ^ ^ rt rt ^ ^ r" rt r* r* rt rt rt rt r" 

gaggcaagga acgccggggc 


^rtr*^rtrt + ^r*'^ 

agcagg l tea 


780 


aagatcacag 


gagccggcgg 


cggcgggagc 


ataatagccc actgcccggg 


atgtgtggat 


840 


gatgttgtca 


cggcccttaa 


caggaactgg 


aaagccatga gggcagagtt 


ttcggttaag 


900 


ggactcatct 


aa 








912 


<210> 27 
<211> 855 
<212> DNA 

<213> Archaeoglobus fulgidus 








<400> 27 
tcactcttca 


attctgacac 


cctccttttc 


gggctcgact atgaagctcc 


ctttcggctt 


60 


ctctcccttg 


aacaggccga 


atatacaccc 


cccaccaccc gctccagtta 


ttttcgcatt 


120 


caaacccatc 


ctttcgagct 


ctgcaatggt 


tctgtcgatt tcgggattgc 


tcacccctat 


180 


cgccctcaga 


agcgactggt 


ttatggctat 


gagctcctca agtctttcag 


cgctgccaac 


240 


atcgctcgcc 


tcaagggaga 


ttgcgtcgat 


agcgtcaaat attttatcca 


caacctcagg 


300 


atgcctctct 


ctcagctcag 


caacttttgc 


gaccatctca gccgttgacc tgctgccaaa 


360 


gttgataacg 


aagaacttga 


acggcatctc 


gacctttcta cgctcaggga 


aaagccacga 


420 


acctccaaag 


gtggatatga 


acgggtctat 


tccgctcgcc cttccctgaa 


cgtcaatctc 


480 


aacctgcttc 


gccatctgga 


agatggcctc 


tttatccata tccccatcga actctgcatt 


540 


cagagcagct 


attgtggcaa 


caatcaccgc 


cgcagagctg cccagcccag 
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agccaatcgg 


600 



) , 



tatttcgctc 


tcaatctcaa 


tttcagcacc 
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cggaatattt ctaagctccc 


caaacctctt 


4**. 

660 


aacagcctgc acgacgtagg gatgcctttg 


atagtccagc ccgctctctc 


ctaacgaaga 


720 


cctaatcaga aacctgtcag 


attttctcac 


cgacactctg catctaaggt 


taatcgctga 


780 


caccaccgca tgcctaccgt aaaccaccgc 


atgctcgccg aacagaatta tctttccggg 


840 


tgctgatgca 


atcat 








855 


<210> 28 
<211> 939 

<212> DNA 

<213> Methanococcus jannaschii 








<400> 28 
atgataattg 


aaacaccatc 


aaaagttata 


ctattcggag agcatgcagt 


tgtttatggt 


60 


tatagagcta 


tatctatggc 


tattgattta 


acatcaacca tagaaataaa 


agaaacacaa 


120 


gaagatgaga 


taattttaaa 


cctaaatgac 


ttgaataaaa gcttaggttt 


gaacttaaat 


180 


gagataaaaa 


atatcaatcc 


aaataacttt 


ggagatttta aatactgcct 


ctgtgcaatt 


240 


aaaaacactt 


tagattattt 


aaatatagag 


ccaaaaactg gttttaaaat 


taacattagc 


300 


tcaaaaattc 


caataagttg 


tggtttggga 


agctctgcct caataacaat 


tggaactata 


360 


aaagctgtaa 


gtggatttta 


taataaagag 


cttaaagatg atgagattgc 


aaaacttgga 


420 


tatatggttg 


agaaagaaat 


ccaaggtaag 


gcaagcatta cagacacttc 


gacaataacg 


480 


tataaaggta 


tcttagaaat 


aaaaaacaac 


aagtttagaa aaattaaagg 


agagtttgaa 


540 


gaatttttaa 


aaaattgcaa 


gtttttaatt 


gtttatgctg aaaaaaggaa 


gaaaaaaact 


600 


gctgagttag 


ttaatgaagt 


tgccaagatt 


gaaaataaag atgagatatt 


taaagagata 


660 


gacaaagtta 


ttgatgaagc 


tttaaaaatc 


aaaaataaag aagattttgg 


gaaattgatg 


720 


actaaaaacc 


acgagttgtt 


aaaaaagcta 


ddLdLLLLdd LdLCddddCl 


Lga Ldgaa l l 


780 


gtagatattg 


ggaatagatt 


tggttttggg 


LjLddddi idd ciygaycLyg 


a ggggg a gga 


840 


tgtgtaataa 


tcttagttaa 


tgaagaaaaa 


y dy dddLJ dye tlllaaadga 


dCLdddlddd 


900 


gaagatgtaa 


ggatttttaa 


ctgcagaatg 


atgaattaa 




939 


<210> 29 
<211> 975 

<212> DNA 

<213> Aeropyrum pernix 








<400> 29 
atgaggaggg 


ctgctagggc 


gtctgccccg 


gggaaagtta taategttgg 


agaacacttc 


60 


gtcgtcagag 


gctccctggc 


gatagtggcg 


gecataggea gaaggctccg 


cgtcaccgtg 


120 


agaagcgggg 


gcaaggggat 


tgtgcttgag 


ageagcatge taggccgcca 


cagcgccccg 


180 


ctaccaggcc 


agggtgcagc 


ggctaaggta 


agccccgtcc tegagcegta 
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catagcagtg 


240 



is 
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ttgagaagtc 


tggctgcaag 


gggctatagc 


gtagtgcccc 


atacaatatt 


ggtggagagc 


300 


ggcatacccc 


ctagggcagg 


tctcggtagc 


agcgccgcca 


gcatggtagc 


ctatgctcta 


360 


tcatactcgg 


ccatgcatgg 


tgaccccctc 


tcggctgagg 


acctctacag 


tgttgctatg 


420 


gagggcgaga 


agatagcgca 


tggtaagccg 


agcggtgttg 


acgtaaccat 


agccgttagg 


480 


gggggagtcc 


tggcttacag 


gaggggcgag 


aacccggtgg 


atataaggcc 


ggggcttaca 


540 


ggtgtcactc 


tgcttgttgc 


cgacacgggt 


gtcgagaggc 


gtactaggga 


tgttgtcgag 


600 


catgttctct 


ccattgcgga 


cgccttggga 


gaggcatcga cctacatata tagggcggca 


660 


gacttgatag 


cgagagaagc 


cctccatgcg 


atagaaaagg 


gagacgccga 


gaggctaggt 


720 


cttataatga 


atgcagccca 


gggccttctc 


tcatctcttg 


gggcgtcgtc 


actagaaata 


780 


gaaacactag 


tatatcggat 


gaggagtgcc 


ggggccctgg 


gtgcaaagct 


aacgggagct 


840 


ggatgggggg 


gctgtgtgat 


agggcttttc 


aaggagggtg 


aggtcgaacg 


ggggctagag 


900 


tctgtggtag 


agagttcaag 


ccaggctttc 

* 


accgcgtcaa tagcagagga 


gggtgctaga 


960 


ctcgaggagt 


tctag 










975 



<210> 30 
<211> 432 
<212> PRT 

<213> Phaffia rhodozyma ATCC 96594 
<400> 30 

Lys Glu Glu lie Leu val Ser Ala Pro Gly Lys val lie Leu Phe Gly 
15 10 15 

Glu His Ala val Gly His Gly Val Thr Gly lie Ala Ala Ser val Asp 

20 25 30 

Leu Arg cys Tyr Ala Leu Leu Ser Pro Thr Ala Thr Thr Thr Thr Ser 
35 40 45 

Ser Ser Leu Ser Ser Thr Asn lie Thr lie Ser Leu Thr Asp Leu Asn 
50 55 60 

Phe Thr Gin ser Trp Pro val Asp Ser Leu Pro Trp Ser Leu Ala Pro 
65 70 75 80 

Asp Trp Thr Glu Ala Ser lie Pro Glu Ser Leu Cys Pro Thr Leu Leu 

85 90 95 

Ala Glu lie Glu Arg lie Ala Gly Gin Gly Gly Asn Gly Gly Glu Arg 

100 105 110 
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Glu Lys val Ala Thr Met Ala Phe Leu Tyr Leu Leu val Leu Leu ser 
115 120 125 

Lys Gly Lys Pro Ser Glu Pro Phe Glu Leu Thr Ala Arg Ser Ala Leu 
130 135 140 

Pro Met Gly Ala Gly Leu Gly Ser Ser Ala Ala Leu Ser Thr Ser Leu 
145 150 155 160 

Ala Leu val Phe Leu Leu His Phe Ser His Leu Ser Pro Thr Thr Thr 

165 170 175 

Gly Arg Glu Ser Thr lie Pro Thr Ala Asp Thr Glu val lie Asp Lys 

180 185 190 

Trp Ala Phe Leu Ala Glu Lys val lie His Gly Asn Pro Ser Gly lie 
195 200 205 

Asp Asn Ala Val Ser Thr Arg Gly Gly Ala Val Ala Phe Lys Arg Lys 
210 215 220 

lie Glu Gly Lys Gin Glu Gly Gly Met Glu Ala lie Lys Ser Phe Thr 
225 230 235 240 

Ser lie Arg Phe Leu lie Thr Asp Ser Arg lie Gly Arg Asp Thr Arg 

245 250 255 

Ser Leu val Ala Gly val Asn Ala Arg Leu lie Gin Glu Pro Glu val 

260 265 270 

lie Val Pro Leu Leu Glu Ala lie Gin Gin lie Ala Asp Glu Ala lie 
275 280 285 

Arg cys Leu Lys Asp Ser Glu Met Glu Arg Ala val Met lie Asp Arg 
290 295 300 

Leu Gin Asn Leu val Ser Glu Asn His Ala His Leu Ala Ala Leu Gly 
305 310 315 320 

val Ser His Pro Ser Leu Glu Glu lie lie Arg lie Gly Ala Asp Lys 

325 330 335 

Pro Phe Glu Leu Arg Thr Lys Leu Thr Gly Ala Gly Gly Gly Gly Cys 

340 345 350 

Ala val Thr Leu val Pro Asp Asp Phe Ser Thr Glu Thr Leu Gin Ala 
355 360 365 
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Leu Met Glu Thr Leu val Gin Ser Ser Phe Ala Pro Tyr lie Ala Arg 
370 375 380 

val Gly Gly Ser Gly val Gly Phe Leu Ser Ser Thr Lys Ala Asp Pro 
385 390 395 400 

Glu Asp Gly Glu Asn Arg Leu Lys Asp Gly Leu val Gly Thr Glu lie 



Asp Glu Leu Asp Arg Trp Ala Leu Lys Thr Gly Arg Trp Ser Phe Ala 

420 425 430 

<210> 31 
<211> 4135 
<212> DNA 

<213> Phaffia rhodozyma ATCC96594 
<400> 31 

actgactcgg ctaccggaaa atatcttttc aggacgcctt gatcgttttg gacaacacca 60 

tgatgtcacc atatcttcag cggccgttgg agctaggagt agacattgta tacgactctg 120 

gaacaaagta tttgagtgga caccacgatc tcatggctgg tgtgattact actcgtactg 180 

aggagattgg gaaggttcgt gcttgcttgc tttgaatgtc gtgcctaaag ccattgccat 240 

aagacagagt ctgatctatg tcgtttgcct acaacagaga atggcctggt tcccaaatgc 300 

tatgggaaat gcattgtctc cgttcgactc gttccttctt ctccgaggac tcaaaacact 360 

tcctctccga ctggacaagc agcaggcctc atctcacctg atcgcctcgt acttacacac 420 

cctcggcttt cttgttcact accccggtct gccttctgac cctgggtacg aacttcataa 480 

ctctcaggcg agtggtgcag gtgccgtcat gagctttgag accggagata tcgcgttgag 540 

tgaggccatc gtgggcggaa cccgagtttg gggaatcagt gtcagtttcg gagccgtgaa 600 

cagtttgatc agcatgcctt gtctaatgag gttagttctt atgccttctt ttcgcgcctt 660 

ctaaaatttc tggctgacta attgggtcgg tctttccgtt cttgcatttc agtcacgcat 720 

ctattcctgc tcaccttcga gccgagcgag gtctccccga acatctgatt cgactgtgtg 780 

tcggtattga ggaccctcac gatttgcttg atgatttgga ggcctctctt gtgaacgctg 840 

gcgcaatccg atcagtctct acctcagatt catcccgacc gctcactcct cctgcctctg 900 

attctgcctc ggacattcac tccaactggg ccgtcgaccg agccagacag ttcgagcgtg 960 

ttaggccttc taactcgaca gccggcgtcg aaggacagct tgccgaactc aatgtagacg 1020 

atgcagccag acttgcgggc gatgagagcc aaaaagaaga aattcttgtc agtgcaccgg 1080 

gaaaggtcat tctgttcggc gaacatgctg taggccatgg tgttgtgagt gagaaatgaa 1140 

agctttatgc tctcattgca tcttaacttt tcctcgcctt ttttgttctc ttcatcccgt 1200 
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cttgattgta 


gggatgcccc 


cctttgcccc 


tttccccttc 


ttgcatctgt 


ctatatttcc 


1260 


ttatacattt 


cgctcttaag 


agcgtctagt 


tgtaccttat 


aacaaccttt 


ggttttagca 


1320 


tcctttgatt 


attcatttct 


ctcatccttc 


ggtcagaggc 


tttcggccat 


ctttacgtct 


1380 


gattagattg 


taatagcaag 


aactatcttg 


ctaagccttt 


tctcttcctc 


ttcctcctat 


1440 


ataaatcgaa 


ttcactttcg 


gacatgttta 


ttttggggaa 


atcatcaagg 


ggtggggggc 


1500 


caatcccgac 


actaattttc 


tgctcacgtc 


aaaactcagc 


gttcagaatc 


agtcactgac 


1560 


cctgatacgt 


gtctctatgt 


gtgtgggtgt 


acgtgcgaat 


tgtgactcga 


cgttctacgc 


1620 


ttaaaaacag 


accgggatcg 


ctgcttccgt 


tgatcttcga 


tgctacgctc 


ttctctcacc 


1680 


cactgctacg 


acaacaacat 


catcgtcgtt 


atcgtctaca 


aacattacca 


tctccctaac 


1740 


ggacctgaac 


tttacgcagt 


cttggcctgt 


tgattctctt 


ccttggtcac 


ttgcgcctga 


1800 


ctggactgag 


gcgtctattc 


cagaatctct 


ctgcccgaca 


ttgctcgccg 


aaatcgaaag 


1860 


gatcgctggt 


caaggtggaa 


acggaggaga 


aagggagaag 


gtggcaacca 


tggcattctt 


1920 


gtatttgttg 


gtgctattga 


gcaaagggaa 


gccaaggtag 


gttttttctg 


tctcttcttt 


1980 


ttgcctataa 


agactcttaa 


ctgacggaga 


aagtgttggg 


tttcttcctt 


cgggggttca 


2040 


atcaattaaa 


gtgagccgtt 


cgagttgacg 


gctcgatctg 


cgcttccgat 


gggagctggt 


2100 


ctgggttcat 


ccgccgctct 


atcgacctct 


cttgccctag 


tctttcttct 


ccacttttct 


2160 


cacctcagtc 


caacgacgac 


tggcagagaa 


tcaacaatcc 


cgacggccga 


cacagaagta 


2220 


attgacaaat 


gggcgttctt 


agctgaaaaa 


gtcatccatg 


gaaatccgag 


tgggattgat 


2280 


aacgcggtca 


gtacgagagg 


aggcgctgtt 


gctttcaaaa 


gaaagattga 


gggaaaacag 


2340 


gaaggtggaa 


tggaagcgat 


caagaggtac 


gcagacacgg 


tgcttcatat 


gccatactcc 


2400 


agtctgattg 


acccatgatg 


aacgtctttc 


tacatttcga 


atatagcttc 


acatccattc 


2460 


gattcctcat 


cacagattct 


cgtatcggaa 


gggatacaag 


atctctcgtt 


gcaggagtga 


2520 


atgctcgact 


gattcaggag 


ccagaggtga 


tcgtcccttt 


gttggaagcg 


attcagcaga 


2580 


ttgccgatga 


ggctattcga 


tgcttgaaag 


attcagagat 


ggaacgtgct 


gtcatgatcg 


2640 


atcgacttca 


agttagttct 


tgttcctttc 


aagactcttt 


gtgacattgt 


gtcttatcca 


2700 


tttcatcttc 


ttttttcttc 


cttcttctgc 


agaacttggt 


ctccgagaac 


cacgcacacc 


2760 


tagcagcact 


tggcgtgtcc 


cacccatccc 


tcgaagagat 


tatccggatc 


ggtgctgata 


2820 


agcctttcga 


gcttcgaaca 


aagttgacag 


gcgccggtgg 


aggtggttgc 


gctgtaaccc 


2880 


tggtgcccga 


tggtaaagtc 


tctccttttc 


tcttccgtcc 


aagcgacaca 


tctgaccgat 


2940 


gcgcatcctg 


tacttttggt 


caaccagact 


tctcgactga 


aacccttcaa 


gctcttatgg 


3000 


agacgctcgt 


tcaatcatcg 


ttcgcccctt 


atattgcccg 


agtgggtggt 


tcaggcgtcg 


3060 


gattcctttc 


atcaactaag 


gccgatccgg 


aagatgggga 
Page 


gaacagactt 
37 


aaagatgggc 


3120 
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tggtgggaac 


ggagattgat 


gagctagaca 


gatgggcttt 


gaaaacgggt 


cgttggtctt 


fflBk 

3180 


ttgcttgaac 


gaaagatagg 


aaacggtgat 


tagggtacag 


atcctttgct 


gtcattttta 


3240 


caaaacactt 


tcttatgtct 


tcatgactca 


acgtatgccc 


tcatctctat 


ccatagacag 


3300 


cacggtacct 


ctcaggtttc 


aatacgtaag 


cgttcatcga 


caaaacatgc 


ggcacacgaa 


3360 


aacgagtgga 


tataagggag 


aagagagata 


ttagagcgaa 


aaagagaaga 


gtgagagagg 


3420 


aaaaaaataa 


ccgagaacaa 


cttattccgg 


tttgttagaa 


tcgaagatcg 


agaaatatga 


3480 


agtacatagt 


ataaagtaaa 


gaagagaggt 


ttacctcaga 


ggtgtgtacg 


aaggtgagga 


3540 


caggtaagag 


gaataattga 


ctatcgaaaa 


aagagaactc 


aacagaagca 


ctgggataaa 


3600 


gcctagaatg 


taagtctcat 


cggtccgcga 


tgaaagagaa 


attgaaggaa 


gaaaaagccc 


3660 


ccagtaaaca 


atccaaccaa 


cctcttggac 


gattgcgaaa 


cacacacacg 


cacgcggaca 


3720 


tatttcgtac 


acaaggacgg 


gacattcttt 


ttttatatcc 


gggtggggag 


agagagggtt 


3780 


atagaggatg 


aatagcaagg 


ttgatgtttt 


gtaaaaggtt 


gcagaaaaag 


gaaagtgaga 


3840 
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ctttccgggc 


gctttcttag 


accgcggtgg 


tgaagggtta 


ctcctgccaa 


ctagaagaag 


3960 


caacatgagt 


caaggattag 


atcatcacgt 


gtctcatttg 


acgggttgaa 


agatatattt 


4020 


agatactaac 


tgcttcccac 


gccgactgaa 


aagatgaatt. 


gaatcatgtc 


gagtggcaac 


4080 


gaacgaaaga 


acaaatagta 


agaatgaatt 


actagaaaag 


acagaatgac 


tagaa 


4135 


<210> 32 
<211> 1164 
<212> DNA 

<213> Artificial sequence 










<220> 

<223> Paracoccus zeaxanthinifaciens H7T 








<400> 32 
atgagaggat 


cgcatcacca 


tcaccatcac 


tcgaccggca 


ggcctgaagc 


aggcgcccat 


60 


gccccgggca 


agctgaccct 


gtccggggaa 


cattccgtgc 


tctatggtgc 


gcccgcgctt 


120 


gccatggcca 


tcgcccgcta taccgaggtg 


tggttcacgc 


cgcttggcat 


tggcgagggg 


180 


atacgcacga 


cattcgccaa tctctcgggc 


ggggcgacct 


attcgctgaa 


gctgctgtcg 


240 


gggttcaagt 


cgcggctgga 


ccgccggttc 


gagcagttcc 


tgaacggcga 


cctaaaggtg 


300 


cacaaggtcc 


tgacccatcc 


cgacgatctg 


gcggtctatg 


cgctggcgtc 


gcttctgcac 


360 


gacaagccgc 


cggggaccgc 


cgcgatgccg 


ggcatcggcg 


cgatgcacca 


cctgccgcga 


420 


ccgggtgagc 


tgggcagccg gacggagctg 


cccatcggcg 


cgggcatggg 


gtcgtctgcg 


480 


gccatcgtcg 


cggccaccac ggtcctgttc 


gagacgctgc 


tggaccggcc 


caagacgccc 


540 
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gagcggttga agcacggcaa 


ggccggtccc 


600 


atcgacgcgg 


ccagcgtcgt 


gcgcggcggg 


cttgtccgcg 


tgggcgggaa 


cgggccgggt 


660 


tcgatcagca 


gcttcgattt 


gcccgaggat 


cacgaccttg 


tcgcgggacg 


cggctggtac 


720 


tgggtactgc 


acgggcgccc 


cgtcagcggg 


accggcgaat 


gcgtcagcgc 


ggtcgcggcg 


780 


gcgcatggtc 


gcgatgcggc 


gctgtgggac 


gccttcgcag 


tctgcacccg 


cgcgttggag 


840 


gccgcgctgc 


tgtctggggg 


cagccccgac 


gccgccatca 


ccgagaacca 


gcgcctgctg 


900 


gaacgcatcg 


gcgtcgtgcc 


ggcagcgacg 


caggccctcg 


tggcccagat 


cgaggaggcg 


960 


aataacacaa 
yy *-yy *-y*-yy 


ccaaaatcta 


caacacaaat 


tccgtgcggg gcgatcacgg 


*-yyyy*-yy 


1020 


ctcgtgcgga 


ttgacgacgc 


gcaggcgatg 


gcttcggtca 


tggcgcgcca 


tcccgacctc 


1080 


gactgggcgc 


ccctgcgcat 


gtcgcgcacg 


ggggcggcac 


ccggccccgc 


gccgcgtgcg 


1140 


caaccgctgc 


cggggcaggg 


ctga 








1164 


<210> 33 
<211> 1164 

<212> DNA 

<213> Artificial sequence 










<220> 

<223> Paracoccus zeaxanthinifaciens I17T, G47D, K93E, 


P132S 




<400> 33 
atgagaggat 


cgcatcacca 


tcaccatcac 


tcgaccggca 


ggcctgaagc 


aggcgcccat 


60 


gccccgggca 


agctgaccct 


gtccggggaa 


cattccgtgc 


tctatggtgc 


gcccgcgctt 


120 


gccatggcca 


tcgcccgcta 


taccgaggtg 


tggttcacgc 


cgcttgacat 


tggcgagggg 


180 


atacgcacga 


cattcgccaa 


tctctcgggc 


ggggcgacct 


attcgctgaa 


gctgctgtcg 


240 


gggttcaagt 


cgcggctgga 


ccgccggttc 


gagcagttcc 


tgaacggcga 


cctaaaggtg 


300 


cacgaggtcc 


tgacccatcc 


cgacgatctg 


gcggtctatg 


cgctggcgtc 


gcttctgcac 


360 


gacaagccgc 


cggggaccgc 


cgcgatgccg 


ggcatcggcg 


cgatgcacca 


cctgccgcga 


420 


tccggtgagc 


tgggcagccg 


gacggagctg 


cccatcggcg 


cgggcatggg 


gtcgtctgcg 


480 


gccatcgtcg 


cggccaccac 


ggtcctgttc 


gagacgctgc 


tggaccggcc 


caagacgccc 


540 


gaacagcgct 


tcgaccgcgt 


ccgcttctgc 


gagcggttga 


agcacggcaa 


ggccggtccc 


600 


atcgacgcgg 


ccagcgtcgt 


gcgcggcggg 


cttgtccgcg 


tgggcgggaa 


cgggccgggt 


660 


tcgatcagca 


gcttcgattt 


gcccgaggat 


cacgaccttg 


tcgcgggacg 


cggctggtac 


720 


tgggtactgc 


acgggcgccc 


cgtcagcggg 


accggcgaat 


gcgtcagcgc 


aatcacaaca 

yy *-*-y *-yy *-y 


780 


gcgcatggtc 


gcgatgcggc 


gctgtgggac 


gccttcgcag 


tctgcacccg 


cgcgttggag 


840 


gccgcgctgc 


tgtctggggg 


cagccccgac 


gccgccatca 


ccgagaacca 


gcgcctgctg 


900 


gaacgcatcg 


gcgtcgtgcc 


ggcagcgacg 


caggccctcg 
Page 


tggcccagat 
39 


cgaggaggcg 


960 
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ggtggcgcgg ccaagatctg cggcgcaggt tccgtgcggg gcgatcacgg cggggcggtc 1020 

9 ctcgtgcgga ttgacgacgc gcaggcgatg gcttcggtca tggcgcgcca tcccgacctc 1080 

gactgggcgc ccctgcgcat gtcgcgcacg ggggcggcac ccggccccgc gccgcgtgcg 1140 

caaccgctgc cggggcaggg ctga 1164 



Page 40 



